Dataset statistics
| Number of variables | 31 |
|---|---|
| Number of observations | 140000 |
| Missing cells | 1624169 |
| Missing cells (%) | 37.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.2 MiB |
| Average record size in memory | 241.0 B |
Variable types
| Numeric | 21 |
|---|---|
| Categorical | 4 |
| Boolean | 4 |
| Unsupported | 1 |
| Text | 1 |
alcohol_use is highly overall correlated with cigarette_use and 3 other fields | High correlation |
apgar_1min is highly overall correlated with child_race and 3 other fields | High correlation |
apgar_5min is highly overall correlated with record_weight | High correlation |
born_alive_alive is highly overall correlated with ever_born | High correlation |
child_race is highly overall correlated with apgar_1min and 2 other fields | High correlation |
cigarette_use is highly overall correlated with alcohol_use and 3 other fields | High correlation |
cigarettes_per_day is highly overall correlated with alcohol_use and 2 other fields | High correlation |
drinks_per_week is highly overall correlated with alcohol_use and 2 other fields | High correlation |
ever_born is highly overall correlated with born_alive_alive | High correlation |
father_age is highly overall correlated with mother_married | High correlation |
father_race is highly overall correlated with mother_race | High correlation |
mother_married is highly overall correlated with father_age | High correlation |
mother_race is highly overall correlated with father_race | High correlation |
mother_residence_state is highly overall correlated with state | High correlation |
record_weight is highly overall correlated with alcohol_use and 9 other fields | High correlation |
source_year is highly overall correlated with apgar_1min and 3 other fields | High correlation |
state is highly overall correlated with mother_residence_state | High correlation |
wday is highly overall correlated with record_weight | High correlation |
weight_gain_pounds is highly overall correlated with record_weight | High correlation |
year is highly overall correlated with apgar_1min and 3 other fields | High correlation |
state is highly imbalanced (55.9%) | Imbalance |
plurality is highly imbalanced (90.7%) | Imbalance |
mother_residence_state is highly imbalanced (80.9%) | Imbalance |
cigarette_use is highly imbalanced (53.5%) | Imbalance |
alcohol_use is highly imbalanced (56.4%) | Imbalance |
record_weight is highly imbalanced (89.1%) | Imbalance |
day has 131346 (93.8%) missing values | Missing |
wday has 8654 (6.2%) missing values | Missing |
state has 123104 (87.9%) missing values | Missing |
child_race has 124068 (88.6%) missing values | Missing |
apgar_1min has 127032 (90.7%) missing values | Missing |
apgar_5min has 11933 (8.5%) missing values | Missing |
mother_residence_state has 123104 (87.9%) missing values | Missing |
mother_race has 64325 (45.9%) missing values | Missing |
gestation_weeks has 2301 (1.6%) missing values | Missing |
mother_birth_state has 123747 (88.4%) missing values | Missing |
cigarette_use has 81576 (58.3%) missing values | Missing |
cigarettes_per_day has 134232 (95.9%) missing values | Missing |
alcohol_use has 74321 (53.1%) missing values | Missing |
drinks_per_week has 136403 (97.4%) missing values | Missing |
weight_gain_pounds has 11178 (8.0%) missing values | Missing |
born_alive_alive has 93590 (66.8%) missing values | Missing |
born_alive_dead has 93623 (66.9%) missing values | Missing |
born_dead has 93652 (66.9%) missing values | Missing |
father_race has 64325 (45.9%) missing values | Missing |
apgar_5min is highly skewed (γ1 = 29.33223705) | Skewed |
born_alive_dead is highly skewed (γ1 = 42.83459027) | Skewed |
born_dead is highly skewed (γ1 = 32.97234222) | Skewed |
lmp is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
drinks_per_week has 3335 (2.4%) zeros | Zeros |
born_alive_alive has 18875 (13.5%) zeros | Zeros |
born_alive_dead has 45519 (32.5%) zeros | Zeros |
born_dead has 36177 (25.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-07-30 18:00:03.123913 |
|---|---|
| Analysis finished | 2024-07-30 18:02:26.682203 |
| Duration | 2 minutes and 23.56 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
source_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.2591 |
| Minimum | 1969 |
|---|---|
| Maximum | 2008 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1969 |
|---|---|
| 5-th percentile | 1985 |
| Q1 | 2005 |
| median | 2006 |
| Q3 | 2007 |
| 95-th percentile | 2008 |
| Maximum | 2008 |
| Range | 39 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 7.0601516 |
|---|---|
| Coefficient of variation (CV) | 0.0035225742 |
| Kurtosis | 9.4412916 |
| Mean | 2004.2591 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -3.1472445 |
| Sum | 2.8059628 × 108 |
| Variance | 49.84574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2007 | 31305 | |
| 2006 | 31118 | |
| 2008 | 31025 | |
| 2005 | 29656 | |
| 1997 | 548 | 0.4% |
| 1991 | 543 | 0.4% |
| 1998 | 543 | 0.4% |
| 1990 | 540 | 0.4% |
| 1988 | 539 | 0.4% |
| 1980 | 535 | 0.4% |
| Other values (30) | 13648 |
| Value | Count | Frequency (%) |
| 1969 | 380 | |
| 1970 | 402 | |
| 1971 | 276 | |
| 1972 | 236 | |
| 1973 | 232 | |
| 1974 | 234 | |
| 1975 | 244 | |
| 1976 | 421 | |
| 1977 | 532 | |
| 1978 | 519 |
| Value | Count | Frequency (%) |
| 2008 | 31025 | |
| 2007 | 31305 | |
| 2006 | 31118 | |
| 2005 | 29656 | |
| 2004 | 505 | 0.4% |
| 2003 | 459 | 0.3% |
| 2002 | 491 | 0.4% |
| 2001 | 491 | 0.4% |
| 2000 | 520 | 0.4% |
| 1999 | 508 | 0.4% |
year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.2591 |
| Minimum | 1969 |
|---|---|
| Maximum | 2008 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1969 |
|---|---|
| 5-th percentile | 1985 |
| Q1 | 2005 |
| median | 2006 |
| Q3 | 2007 |
| 95-th percentile | 2008 |
| Maximum | 2008 |
| Range | 39 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 7.0601516 |
|---|---|
| Coefficient of variation (CV) | 0.0035225742 |
| Kurtosis | 9.4412916 |
| Mean | 2004.2591 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -3.1472445 |
| Sum | 2.8059628 × 108 |
| Variance | 49.84574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2007 | 31305 | |
| 2006 | 31118 | |
| 2008 | 31025 | |
| 2005 | 29656 | |
| 1997 | 548 | 0.4% |
| 1991 | 543 | 0.4% |
| 1998 | 543 | 0.4% |
| 1990 | 540 | 0.4% |
| 1988 | 539 | 0.4% |
| 1980 | 535 | 0.4% |
| Other values (30) | 13648 |
| Value | Count | Frequency (%) |
| 1969 | 380 | |
| 1970 | 402 | |
| 1971 | 276 | |
| 1972 | 236 | |
| 1973 | 232 | |
| 1974 | 234 | |
| 1975 | 244 | |
| 1976 | 421 | |
| 1977 | 532 | |
| 1978 | 519 |
| Value | Count | Frequency (%) |
| 2008 | 31025 | |
| 2007 | 31305 | |
| 2006 | 31118 | |
| 2005 | 29656 | |
| 2004 | 505 | 0.4% |
| 2003 | 459 | 0.3% |
| 2002 | 491 | 0.4% |
| 2001 | 491 | 0.4% |
| 2000 | 520 | 0.4% |
| 1999 | 508 | 0.4% |
month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5510071 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4225862 |
|---|---|
| Coefficient of variation (CV) | 0.52245191 |
| Kurtosis | -1.186765 |
| Mean | 6.5510071 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.02758067 |
| Sum | 917141 |
| Variance | 11.714096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 12458 | |
| 7 | 12321 | |
| 9 | 11995 | |
| 6 | 11723 | |
| 5 | 11684 | |
| 3 | 11681 | |
| 10 | 11671 | |
| 12 | 11669 | |
| 1 | 11559 | |
| 11 | 11367 | |
| Other values (2) | 21872 |
| Value | Count | Frequency (%) |
| 1 | 11559 | |
| 2 | 10674 | |
| 3 | 11681 | |
| 4 | 11198 | |
| 5 | 11684 | |
| 6 | 11723 | |
| 7 | 12321 | |
| 8 | 12458 | |
| 9 | 11995 | |
| 10 | 11671 |
| Value | Count | Frequency (%) |
| 12 | 11669 | |
| 11 | 11367 | |
| 10 | 11671 | |
| 9 | 11995 | |
| 8 | 12458 | |
| 7 | 12321 | |
| 6 | 11723 | |
| 5 | 11684 | |
| 4 | 11198 | |
| 3 | 11681 |
day
Real number (ℝ)
MISSING 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 131346 |
| Missing (%) | 93.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.753062 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7700393 |
|---|---|
| Coefficient of variation (CV) | 0.55671966 |
| Kurtosis | -1.1853201 |
| Mean | 15.753062 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.019430249 |
| Sum | 136327 |
| Variance | 76.91359 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 321 | 0.2% |
| 14 | 315 | 0.2% |
| 23 | 305 | 0.2% |
| 10 | 304 | 0.2% |
| 20 | 301 | 0.2% |
| 27 | 300 | 0.2% |
| 4 | 298 | 0.2% |
| 13 | 295 | 0.2% |
| 17 | 294 | 0.2% |
| 22 | 292 | 0.2% |
| Other values (21) | 5629 | 4.0% |
| (Missing) | 131346 |
| Value | Count | Frequency (%) |
| 1 | 249 | |
| 2 | 290 | |
| 3 | 283 | |
| 4 | 298 | |
| 5 | 286 | |
| 6 | 249 | |
| 7 | 284 | |
| 8 | 321 | |
| 9 | 274 | |
| 10 | 304 |
| Value | Count | Frequency (%) |
| 31 | 170 | |
| 30 | 278 | |
| 29 | 263 | |
| 28 | 275 | |
| 27 | 300 | |
| 26 | 282 | |
| 25 | 255 | |
| 24 | 256 | |
| 23 | 305 | |
| 22 | 292 |
wday
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8654 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0599257 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.8304452 |
|---|---|
| Coefficient of variation (CV) | 0.45085683 |
| Kurtosis | -1.1002734 |
| Mean | 4.0599257 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.023394436 |
| Sum | 533255 |
| Variance | 3.3505297 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 21699 | |
| 5 | 21469 | |
| 4 | 21427 | |
| 6 | 20957 | |
| 2 | 19719 | |
| 7 | 13975 | |
| 1 | 12100 | |
| (Missing) | 8654 | 6.2% |
| Value | Count | Frequency (%) |
| 1 | 12100 | |
| 2 | 19719 | |
| 3 | 21699 | |
| 4 | 21427 | |
| 5 | 21469 | |
| 6 | 20957 | |
| 7 | 13975 |
| Value | Count | Frequency (%) |
| 7 | 13975 | |
| 6 | 20957 | |
| 5 | 21469 | |
| 4 | 21427 | |
| 3 | 21699 | |
| 2 | 19719 | |
| 1 | 12100 |
state
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 123104 |
| Missing (%) | 87.9% |
| Memory size | 1.1 MiB |
| AL | |
|---|---|
| AK | |
| AR | 271 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 33792 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AK |
|---|---|
| 2nd row | AK |
| 3rd row | AK |
| 4th row | AK |
| 5th row | AK |
Common Values
| Value | Count | Frequency (%) |
| AL | 14258 | 10.2% |
| AK | 2367 | 1.7% |
| AR | 271 | 0.2% |
| (Missing) | 123104 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| al | 14258 | |
| ak | 2367 | 14.0% |
| ar | 271 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 16896 | |
| L | 14258 | |
| K | 2367 | 7.0% |
| R | 271 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 16896 | |
| L | 14258 | |
| K | 2367 | 7.0% |
| R | 271 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 16896 | |
| L | 14258 | |
| K | 2367 | 7.0% |
| R | 271 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 16896 | |
| L | 14258 | |
| K | 2367 | 7.0% |
| R | 271 | 0.8% |
is_male
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 136.8 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 71546 | |
| False | 68454 |
child_race
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 124068 |
| Missing (%) | 88.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6471253 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 9 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 3.50963 |
|---|---|
| Coefficient of variation (CV) | 0.96230037 |
| Kurtosis | -1.2381479 |
| Mean | 3.6471253 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.83154627 |
| Sum | 58106 |
| Variance | 12.317503 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7318 | 5.2% |
| 9 | 4714 | 3.4% |
| 2 | 3472 | 2.5% |
| 3 | 380 | 0.3% |
| 7 | 23 | < 0.1% |
| 4 | 13 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 5 | < 0.1% |
| (Missing) | 124068 |
| Value | Count | Frequency (%) |
| 1 | 7318 | |
| 2 | 3472 | |
| 3 | 380 | 0.3% |
| 4 | 13 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 23 | < 0.1% |
| 9 | 4714 |
| Value | Count | Frequency (%) |
| 9 | 4714 | |
| 7 | 23 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 7 | < 0.1% |
| 4 | 13 | < 0.1% |
| 3 | 380 | 0.3% |
| 2 | 3472 | |
| 1 | 7318 |
weight_pounds
Real number (ℝ)
| Distinct | 3568 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 138 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.2109486 |
| Minimum | 0.50044933 |
|---|---|
| Maximum | 16.459712 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0.50044933 |
|---|---|
| 5-th percentile | 5.0000841 |
| Q1 | 6.5609569 |
| median | 7.3127332 |
| Q3 | 8.0248263 |
| 95-th percentile | 9.124933 |
| Maximum | 16.459712 |
| Range | 15.959263 |
| Interquartile range (IQR) | 1.4638694 |
Descriptive statistics
| Standard deviation | 1.3211065 |
|---|---|
| Coefficient of variation (CV) | 0.18320842 |
| Kurtosis | 2.8572719 |
| Mean | 7.2109486 |
| Median Absolute Deviation (MAD) | 0.74957169 |
| Skewness | -0.89931276 |
| Sum | 1008537.7 |
| Variance | 1.7453224 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.374462664 | 2246 | 1.6% |
| 7.561855587 | 2200 | 1.6% |
| 6.999676819 | 2187 | 1.6% |
| 7.187069741 | 2186 | 1.6% |
| 7.500126153 | 2051 | 1.5% |
| 7.749248509 | 1960 | 1.4% |
| 7.251003797 | 1901 | 1.4% |
| 7.312733231 | 1898 | 1.4% |
| 7.687519076 | 1888 | 1.3% |
| 7.125340308 | 1868 | 1.3% |
| Other values (3558) | 119477 |
| Value | Count | Frequency (%) |
| 0.5004493347 | 8 | |
| 0.5070632026 | 2 | < 0.1% |
| 0.5092678252 | 1 | < 0.1% |
| 0.5158816931 | 1 | < 0.1% |
| 0.5180863157 | 1 | < 0.1% |
| 0.5291094288 | 1 | < 0.1% |
| 0.5401325419 | 1 | < 0.1% |
| 0.5445417871 | 1 | < 0.1% |
| 0.5599741455 | 1 | < 0.1% |
| 0.5621787681 | 6 |
| Value | Count | Frequency (%) |
| 16.45971248 | 1 | |
| 15.62636513 | 1 | |
| 14.36752561 | 1 | |
| 13.81196071 | 1 | |
| 13.75023128 | 1 | |
| 13.56945223 | 1 | |
| 13.31151138 | 1 | |
| 13.00065959 | 1 | |
| 12.86397299 | 1 | |
| 12.85956374 | 1 |
plurality
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 782 |
| Missing (%) | 0.6% |
| Memory size | 1.1 MiB |
| 1.0 | |
|---|---|
| 2.0 | 4356 |
| 3.0 | 187 |
| 4.0 | 10 |
| 5.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 417654 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 134663 | |
| 2.0 | 4356 | 3.1% |
| 3.0 | 187 | 0.1% |
| 4.0 | 10 | < 0.1% |
| 5.0 | 2 | < 0.1% |
| (Missing) | 782 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 134663 | |
| 2.0 | 4356 | 3.1% |
| 3.0 | 187 | 0.1% |
| 4.0 | 10 | < 0.1% |
| 5.0 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 139218 | |
| 0 | 139218 | |
| 1 | 134663 | |
| 2 | 4356 | 1.0% |
| 3 | 187 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 417654 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 139218 | |
| 0 | 139218 | |
| 1 | 134663 | |
| 2 | 4356 | 1.0% |
| 3 | 187 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 417654 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 139218 | |
| 0 | 139218 | |
| 1 | 134663 | |
| 2 | 4356 | 1.0% |
| 3 | 187 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 417654 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 139218 | |
| 0 | 139218 | |
| 1 | 134663 | |
| 2 | 4356 | 1.0% |
| 3 | 187 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 2 | < 0.1% |
apgar_1min
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 127032 |
| Missing (%) | 90.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.750463 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 8 |
| median | 9 |
| Q3 | 99 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 91 |
Descriptive statistics
| Standard deviation | 42.63343 |
|---|---|
| Coefficient of variation (CV) | 1.1293485 |
| Kurtosis | -1.4502484 |
| Mean | 37.750463 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.73906407 |
| Sum | 489548 |
| Variance | 1817.6094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 4230 | 3.0% |
| 9 | 3850 | 2.8% |
| 8 | 3082 | 2.2% |
| 7 | 733 | 0.5% |
| 6 | 291 | 0.2% |
| 10 | 291 | 0.2% |
| 5 | 153 | 0.1% |
| 4 | 119 | 0.1% |
| 3 | 80 | 0.1% |
| 1 | 74 | 0.1% |
| (Missing) | 127032 |
| Value | Count | Frequency (%) |
| 1 | 74 | 0.1% |
| 2 | 65 | < 0.1% |
| 3 | 80 | 0.1% |
| 4 | 119 | 0.1% |
| 5 | 153 | 0.1% |
| 6 | 291 | 0.2% |
| 7 | 733 | 0.5% |
| 8 | 3082 | |
| 9 | 3850 | |
| 10 | 291 | 0.2% |
| Value | Count | Frequency (%) |
| 99 | 4230 | |
| 10 | 291 | 0.2% |
| 9 | 3850 | |
| 8 | 3082 | |
| 7 | 733 | 0.5% |
| 6 | 291 | 0.2% |
| 5 | 153 | 0.1% |
| 4 | 119 | 0.1% |
| 3 | 80 | 0.1% |
| 2 | 65 | < 0.1% |
apgar_5min
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11933 |
| Missing (%) | 8.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.9421162 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 60 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 9 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 10 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.8132854 |
|---|---|
| Coefficient of variation (CV) | 0.31461069 |
| Kurtosis | 940.30931 |
| Mean | 8.9421162 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.332237 |
| Sum | 1145190 |
| Variance | 7.9145746 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 104817 | |
| 8 | 11397 | 8.1% |
| 10 | 7626 | 5.4% |
| 7 | 2068 | 1.5% |
| 6 | 778 | 0.6% |
| 5 | 376 | 0.3% |
| 1 | 274 | 0.2% |
| 4 | 221 | 0.2% |
| 2 | 171 | 0.1% |
| 3 | 164 | 0.1% |
| Other values (2) | 175 | 0.1% |
| (Missing) | 11933 | 8.5% |
| Value | Count | Frequency (%) |
| 0 | 60 | < 0.1% |
| 1 | 274 | 0.2% |
| 2 | 171 | 0.1% |
| 3 | 164 | 0.1% |
| 4 | 221 | 0.2% |
| 5 | 376 | 0.3% |
| 6 | 778 | 0.6% |
| 7 | 2068 | 1.5% |
| 8 | 11397 | 8.1% |
| 9 | 104817 |
| Value | Count | Frequency (%) |
| 99 | 115 | 0.1% |
| 10 | 7626 | 5.4% |
| 9 | 104817 | |
| 8 | 11397 | 8.1% |
| 7 | 2068 | 1.5% |
| 6 | 778 | 0.6% |
| 5 | 376 | 0.3% |
| 4 | 221 | 0.2% |
| 3 | 164 | 0.1% |
| 2 | 171 | 0.1% |
mother_residence_state
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 123104 |
| Missing (%) | 87.9% |
| Memory size | 1.1 MiB |
| AL | |
|---|---|
| AK | |
| AR | 263 |
| FL | 87 |
| MS | 64 |
| Other values (18) | 147 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 33792 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | AK |
|---|---|
| 2nd row | AK |
| 3rd row | AK |
| 4th row | AK |
| 5th row | AK |
Common Values
| Value | Count | Frequency (%) |
| AL | 13973 | 10.0% |
| AK | 2362 | 1.7% |
| AR | 263 | 0.2% |
| FL | 87 | 0.1% |
| MS | 64 | < 0.1% |
| GA | 61 | < 0.1% |
| TN | 61 | < 0.1% |
| OK | 6 | < 0.1% |
| NC | 2 | < 0.1% |
| TX | 2 | < 0.1% |
| Other values (13) | 15 | < 0.1% |
| (Missing) | 123104 |
Length
| Value | Count | Frequency (%) |
| al | 13973 | |
| ak | 2362 | 14.0% |
| ar | 263 | 1.6% |
| fl | 87 | 0.5% |
| ms | 64 | 0.4% |
| ga | 61 | 0.4% |
| tn | 61 | 0.4% |
| ok | 6 | < 0.1% |
| nc | 2 | < 0.1% |
| tx | 2 | < 0.1% |
| Other values (13) | 15 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 16662 | |
| L | 14061 | |
| K | 2369 | 7.0% |
| R | 263 | 0.8% |
| F | 87 | 0.3% |
| M | 69 | 0.2% |
| S | 67 | 0.2% |
| N | 66 | 0.2% |
| T | 63 | 0.2% |
| G | 61 | 0.2% |
| Other values (8) | 24 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 16662 | |
| L | 14061 | |
| K | 2369 | 7.0% |
| R | 263 | 0.8% |
| F | 87 | 0.3% |
| M | 69 | 0.2% |
| S | 67 | 0.2% |
| N | 66 | 0.2% |
| T | 63 | 0.2% |
| G | 61 | 0.2% |
| Other values (8) | 24 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 16662 | |
| L | 14061 | |
| K | 2369 | 7.0% |
| R | 263 | 0.8% |
| F | 87 | 0.3% |
| M | 69 | 0.2% |
| S | 67 | 0.2% |
| N | 66 | 0.2% |
| T | 63 | 0.2% |
| G | 61 | 0.2% |
| Other values (8) | 24 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33792 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 16662 | |
| L | 14061 | |
| K | 2369 | 7.0% |
| R | 263 | 0.8% |
| F | 87 | 0.3% |
| M | 69 | 0.2% |
| S | 67 | 0.2% |
| N | 66 | 0.2% |
| T | 63 | 0.2% |
| G | 61 | 0.2% |
| Other values (8) | 24 | 0.1% |
mother_race
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 64325 |
| Missing (%) | 45.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8684638 |
| Minimum | 1 |
|---|---|
| Maximum | 78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 78 |
| Range | 77 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 9.9756692 |
|---|---|
| Coefficient of variation (CV) | 3.4777044 |
| Kurtosis | 45.862648 |
| Mean | 2.8684638 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.7918093 |
| Sum | 217071 |
| Variance | 99.513977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 55327 | |
| 2 | 15424 | 11.0% |
| 3 | 1452 | 1.0% |
| 78 | 912 | 0.7% |
| 4 | 555 | 0.4% |
| 18 | 554 | 0.4% |
| 7 | 474 | 0.3% |
| 68 | 376 | 0.3% |
| 48 | 152 | 0.1% |
| 28 | 150 | 0.1% |
| Other values (5) | 299 | 0.2% |
| (Missing) | 64325 |
| Value | Count | Frequency (%) |
| 1 | 55327 | |
| 2 | 15424 | 11.0% |
| 3 | 1452 | 1.0% |
| 4 | 555 | 0.4% |
| 5 | 141 | 0.1% |
| 6 | 34 | < 0.1% |
| 7 | 474 | 0.3% |
| 9 | 99 | 0.1% |
| 18 | 554 | 0.4% |
| 28 | 150 | 0.1% |
| Value | Count | Frequency (%) |
| 78 | 912 | |
| 68 | 376 | |
| 58 | 4 | < 0.1% |
| 48 | 152 | 0.1% |
| 38 | 21 | < 0.1% |
| 28 | 150 | 0.1% |
| 18 | 554 | |
| 9 | 99 | 0.1% |
| 7 | 474 | |
| 6 | 34 | < 0.1% |
mother_age
Real number (ℝ)
| Distinct | 39 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.124679 |
| Minimum | 12 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 22 |
| median | 27 |
| Q3 | 32 |
| 95-th percentile | 38 |
| Maximum | 50 |
| Range | 38 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.1616839 |
|---|---|
| Coefficient of variation (CV) | 0.22716154 |
| Kurtosis | -0.56389428 |
| Mean | 27.124679 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.27388388 |
| Sum | 3797455 |
| Variance | 37.966348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 8013 | 5.7% |
| 27 | 7856 | 5.6% |
| 28 | 7779 | 5.6% |
| 24 | 7756 | 5.5% |
| 25 | 7707 | 5.5% |
| 23 | 7566 | 5.4% |
| 29 | 7485 | 5.3% |
| 22 | 7309 | 5.2% |
| 30 | 7262 | 5.2% |
| 21 | 7041 | 5.0% |
| Other values (29) | 64226 |
| Value | Count | Frequency (%) |
| 12 | 7 | < 0.1% |
| 13 | 46 | < 0.1% |
| 14 | 240 | 0.2% |
| 15 | 704 | 0.5% |
| 16 | 1593 | 1.1% |
| 17 | 2822 | |
| 18 | 4373 | |
| 19 | 5895 | |
| 20 | 6738 | |
| 21 | 7041 |
| Value | Count | Frequency (%) |
| 50 | 14 | < 0.1% |
| 49 | 8 | < 0.1% |
| 48 | 6 | < 0.1% |
| 47 | 32 | < 0.1% |
| 46 | 50 | < 0.1% |
| 45 | 90 | 0.1% |
| 44 | 177 | 0.1% |
| 43 | 373 | |
| 42 | 586 | |
| 41 | 855 |
gestation_weeks
Real number (ℝ)
MISSING 
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2301 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.90883 |
| Minimum | 17 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 38 |
| median | 39 |
| Q3 | 40 |
| 95-th percentile | 42 |
| Maximum | 99 |
| Range | 82 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 5.0584691 |
|---|---|
| Coefficient of variation (CV) | 0.13000825 |
| Kurtosis | 101.2077 |
| Mean | 38.90883 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 8.3871079 |
| Sum | 5357707 |
| Variance | 25.588109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39 | 34983 | |
| 40 | 26196 | |
| 38 | 25540 | |
| 37 | 12368 | 8.8% |
| 41 | 12140 | 8.7% |
| 36 | 6249 | 4.5% |
| 42 | 4280 | 3.1% |
| 35 | 3627 | 2.6% |
| 34 | 2229 | 1.6% |
| 43 | 2162 | 1.5% |
| Other values (27) | 7925 | 5.7% |
| (Missing) | 2301 | 1.6% |
| Value | Count | Frequency (%) |
| 17 | 16 | < 0.1% |
| 18 | 20 | < 0.1% |
| 19 | 33 | < 0.1% |
| 20 | 42 | < 0.1% |
| 21 | 64 | < 0.1% |
| 22 | 77 | |
| 23 | 120 | |
| 24 | 140 | |
| 25 | 147 | |
| 26 | 192 |
| Value | Count | Frequency (%) |
| 99 | 714 | |
| 52 | 5 | < 0.1% |
| 51 | 5 | < 0.1% |
| 50 | 13 | < 0.1% |
| 49 | 12 | < 0.1% |
| 48 | 10 | < 0.1% |
| 47 | 197 | 0.1% |
| 46 | 302 | 0.2% |
| 45 | 564 | |
| 44 | 1111 |
lmp
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
mother_married
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 1.1 MiB |
| True | |
|---|---|
| False | |
| (Missing) | 12 |
| Value | Count | Frequency (%) |
| True | 87811 | |
| False | 52177 | |
| (Missing) | 12 | < 0.1% |
MISSING 
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 123747 |
| Missing (%) | 88.4% |
| Memory size | 1.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.14182 |
| Min length | 2 |
Characters and Unicode
| Total characters | 34811 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | AK |
|---|---|
| 2nd row | IL |
| 3rd row | MO |
| 4th row | PA |
| 5th row | NE |
| Value | Count | Frequency (%) |
| al | 9783 | |
| ak | 740 | 4.6% |
| ga | 429 | 2.6% |
| fl | 423 | 2.6% |
| foreign | 383 | 2.4% |
| ca | 349 | 2.1% |
| ms | 317 | 2.0% |
| tn | 280 | 1.7% |
| tx | 267 | 1.6% |
| il | 261 | 1.6% |
| Other values (51) | 3021 | 18.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 12104 | |
| L | 10612 | |
| K | 964 | 2.8% |
| N | 940 | 2.7% |
| M | 926 | 2.7% |
| F | 806 | 2.3% |
| I | 769 | 2.2% |
| C | 679 | 2.0% |
| T | 666 | 1.9% |
| O | 517 | 1.5% |
| Other values (26) | 5828 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34811 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 12104 | |
| L | 10612 | |
| K | 964 | 2.8% |
| N | 940 | 2.7% |
| M | 926 | 2.7% |
| F | 806 | 2.3% |
| I | 769 | 2.2% |
| C | 679 | 2.0% |
| T | 666 | 1.9% |
| O | 517 | 1.5% |
| Other values (26) | 5828 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34811 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 12104 | |
| L | 10612 | |
| K | 964 | 2.8% |
| N | 940 | 2.7% |
| M | 926 | 2.7% |
| F | 806 | 2.3% |
| I | 769 | 2.2% |
| C | 679 | 2.0% |
| T | 666 | 1.9% |
| O | 517 | 1.5% |
| Other values (26) | 5828 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34811 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 12104 | |
| L | 10612 | |
| K | 964 | 2.8% |
| N | 940 | 2.7% |
| M | 926 | 2.7% |
| F | 806 | 2.3% |
| I | 769 | 2.2% |
| C | 679 | 2.0% |
| T | 666 | 1.9% |
| O | 517 | 1.5% |
| Other values (26) | 5828 |
cigarette_use
Boolean
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 81576 |
| Missing (%) | 58.3% |
| Memory size | 1.1 MiB |
| False | |
|---|---|
| True | 5768 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 52656 | |
| True | 5768 | 4.1% |
| (Missing) | 81576 |
cigarettes_per_day
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 134232 |
| Missing (%) | 95.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.087205 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 10 |
| Q3 | 10.25 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 5.25 |
Descriptive statistics
| Standard deviation | 23.312643 |
|---|---|
| Coefficient of variation (CV) | 1.5451929 |
| Kurtosis | 8.3398944 |
| Mean | 15.087205 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 3.0950497 |
| Sum | 87023 |
| Variance | 543.4793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 1746 | 1.2% |
| 5 | 707 | 0.5% |
| 20 | 671 | 0.5% |
| 3 | 394 | 0.3% |
| 99 | 388 | 0.3% |
| 2 | 361 | 0.3% |
| 4 | 312 | 0.2% |
| 6 | 246 | 0.2% |
| 1 | 231 | 0.2% |
| 15 | 176 | 0.1% |
| Other values (22) | 536 | 0.4% |
| (Missing) | 134232 |
| Value | Count | Frequency (%) |
| 1 | 231 | 0.2% |
| 2 | 361 | 0.3% |
| 3 | 394 | 0.3% |
| 4 | 312 | 0.2% |
| 5 | 707 | |
| 6 | 246 | 0.2% |
| 7 | 170 | 0.1% |
| 8 | 130 | 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 1746 |
| Value | Count | Frequency (%) |
| 99 | 388 | |
| 60 | 2 | < 0.1% |
| 46 | 1 | < 0.1% |
| 40 | 15 | < 0.1% |
| 36 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 30 | 36 | < 0.1% |
| 27 | 2 | < 0.1% |
| 25 | 8 | < 0.1% |
| 24 | 2 | < 0.1% |
alcohol_use
Boolean
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 74321 |
| Missing (%) | 53.1% |
| Memory size | 1.1 MiB |
| False | |
|---|---|
| True | 5897 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 59782 | |
| True | 5897 | 4.2% |
| (Missing) | 74321 |
drinks_per_week
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 136403 |
| Missing (%) | 97.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2749513 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 3335 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17.424963 |
|---|---|
| Coefficient of variation (CV) | 5.3206784 |
| Kurtosis | 26.180972 |
| Mean | 3.2749513 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.3019109 |
| Sum | 11780 |
| Variance | 303.62933 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3335 | 2.4% |
| 99 | 115 | 0.1% |
| 1 | 72 | 0.1% |
| 2 | 36 | < 0.1% |
| 6 | 11 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 6 | < 0.1% |
| 4 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| (Missing) | 136403 |
| Value | Count | Frequency (%) |
| 0 | 3335 | |
| 1 | 72 | 0.1% |
| 2 | 36 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 6 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99 | 115 | |
| 42 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 6 | < 0.1% |
| 4 | 5 | < 0.1% |
weight_gain_pounds
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11178 |
| Missing (%) | 8.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.383785 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 23 |
| median | 32 |
| Q3 | 45 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 25.401934 |
|---|---|
| Coefficient of variation (CV) | 0.64498457 |
| Kurtosis | 0.96154864 |
| Mean | 39.383785 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.352962 |
| Sum | 5073498 |
| Variance | 645.25824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 15247 | 10.9% |
| 30 | 8910 | 6.4% |
| 20 | 6180 | 4.4% |
| 25 | 5965 | 4.3% |
| 40 | 5867 | 4.2% |
| 35 | 5242 | 3.7% |
| 32 | 3002 | 2.1% |
| 50 | 2866 | 2.0% |
| 28 | 2859 | 2.0% |
| 26 | 2686 | 1.9% |
| Other values (89) | 69998 | |
| (Missing) | 11178 | 8.0% |
| Value | Count | Frequency (%) |
| 1 | 249 | 0.2% |
| 2 | 330 | 0.2% |
| 3 | 330 | 0.2% |
| 4 | 387 | 0.3% |
| 5 | 614 | 0.4% |
| 6 | 488 | 0.3% |
| 7 | 627 | 0.4% |
| 8 | 678 | 0.5% |
| 9 | 586 | 0.4% |
| 10 | 1898 |
| Value | Count | Frequency (%) |
| 99 | 15247 | |
| 98 | 135 | 0.1% |
| 97 | 5 | < 0.1% |
| 96 | 4 | < 0.1% |
| 95 | 19 | < 0.1% |
| 94 | 10 | < 0.1% |
| 93 | 13 | < 0.1% |
| 92 | 6 | < 0.1% |
| 91 | 7 | < 0.1% |
| 90 | 51 | < 0.1% |
born_alive_alive
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93590 |
| Missing (%) | 66.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.054471 |
| Minimum | 0 |
|---|---|
| Maximum | 77 |
| Zeros | 18875 |
| Zeros (%) | 13.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 77 |
| Range | 77 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5214944 |
|---|---|
| Coefficient of variation (CV) | 1.4428983 |
| Kurtosis | 743.95084 |
| Mean | 1.054471 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 16.641932 |
| Sum | 48938 |
| Variance | 2.3149453 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18875 | 13.5% |
| 1 | 15202 | 10.9% |
| 2 | 7482 | 5.3% |
| 3 | 2912 | 2.1% |
| 4 | 1058 | 0.8% |
| 5 | 438 | 0.3% |
| 6 | 215 | 0.2% |
| 7 | 110 | 0.1% |
| 8 | 41 | < 0.1% |
| 9 | 34 | < 0.1% |
| Other values (8) | 43 | < 0.1% |
| (Missing) | 93590 |
| Value | Count | Frequency (%) |
| 0 | 18875 | |
| 1 | 15202 | |
| 2 | 7482 | 5.3% |
| 3 | 2912 | 2.1% |
| 4 | 1058 | 0.8% |
| 5 | 438 | 0.3% |
| 6 | 215 | 0.2% |
| 7 | 110 | 0.1% |
| 8 | 41 | < 0.1% |
| 9 | 34 | < 0.1% |
| Value | Count | Frequency (%) |
| 77 | 5 | < 0.1% |
| 55 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 5 | < 0.1% |
| 12 | 6 | < 0.1% |
| 11 | 9 | < 0.1% |
| 10 | 14 | < 0.1% |
| 9 | 34 | |
| 8 | 41 |
born_alive_dead
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93623 |
| Missing (%) | 66.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.060072881 |
| Minimum | 0 |
|---|---|
| Maximum | 77 |
| Zeros | 45519 |
| Zeros (%) | 32.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 77 |
| Range | 77 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.6336883 |
|---|---|
| Coefficient of variation (CV) | 27.195104 |
| Kurtosis | 1891.9174 |
| Mean | 0.060072881 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 42.83459 |
| Sum | 2786 |
| Variance | 2.6689373 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 45519 | |
| 1 | 707 | 0.5% |
| 2 | 91 | 0.1% |
| 3 | 21 | < 0.1% |
| 77 | 15 | < 0.1% |
| 55 | 11 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
| (Missing) | 93623 |
| Value | Count | Frequency (%) |
| 0 | 45519 | |
| 1 | 707 | 0.5% |
| 2 | 91 | 0.1% |
| 3 | 21 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 77 | 15 | < 0.1% |
| 55 | 11 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 5 | < 0.1% |
| 3 | 21 | < 0.1% |
| 2 | 91 |
born_dead
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93652 |
| Missing (%) | 66.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.36066281 |
| Minimum | 0 |
|---|---|
| Maximum | 77 |
| Zeros | 36177 |
| Zeros (%) | 25.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 77 |
| Range | 77 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7798383 |
|---|---|
| Coefficient of variation (CV) | 4.9349095 |
| Kurtosis | 1321.5141 |
| Mean | 0.36066281 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.972342 |
| Sum | 16716 |
| Variance | 3.1678245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36177 | 25.8% |
| 1 | 7052 | 5.0% |
| 2 | 2029 | 1.4% |
| 3 | 693 | 0.5% |
| 4 | 227 | 0.2% |
| 5 | 82 | 0.1% |
| 6 | 30 | < 0.1% |
| 7 | 21 | < 0.1% |
| 77 | 15 | < 0.1% |
| 55 | 11 | < 0.1% |
| Other values (6) | 11 | < 0.1% |
| (Missing) | 93652 |
| Value | Count | Frequency (%) |
| 0 | 36177 | |
| 1 | 7052 | 5.0% |
| 2 | 2029 | 1.4% |
| 3 | 693 | 0.5% |
| 4 | 227 | 0.2% |
| 5 | 82 | 0.1% |
| 6 | 30 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 3 | < 0.1% |
| 10 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 77 | 15 | |
| 55 | 11 | < 0.1% |
| 18 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 10 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 21 | |
| 6 | 30 |
ever_born
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 723 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0813989 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.2556892 |
|---|---|
| Coefficient of variation (CV) | 0.60329098 |
| Kurtosis | 5.3525575 |
| Mean | 2.0813989 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.7741727 |
| Sum | 289891 |
| Variance | 1.5767554 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 55565 | |
| 2 | 44461 | |
| 3 | 23254 | |
| 4 | 9588 | 6.8% |
| 5 | 3613 | 2.6% |
| 6 | 1446 | 1.0% |
| 7 | 659 | 0.5% |
| 8 | 530 | 0.4% |
| 9 | 54 | < 0.1% |
| 10 | 44 | < 0.1% |
| Other values (5) | 63 | < 0.1% |
| (Missing) | 723 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 55565 | |
| 2 | 44461 | |
| 3 | 23254 | |
| 4 | 9588 | 6.8% |
| 5 | 3613 | 2.6% |
| 6 | 1446 | 1.0% |
| 7 | 659 | 0.5% |
| 8 | 530 | 0.4% |
| 9 | 54 | < 0.1% |
| 10 | 44 | < 0.1% |
| Value | Count | Frequency (%) |
| 15 | 4 | < 0.1% |
| 14 | 8 | < 0.1% |
| 13 | 12 | < 0.1% |
| 12 | 13 | < 0.1% |
| 11 | 26 | < 0.1% |
| 10 | 44 | < 0.1% |
| 9 | 54 | < 0.1% |
| 8 | 530 | 0.4% |
| 7 | 659 | |
| 6 | 1446 |
father_race
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 64325 |
| Missing (%) | 45.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.978077 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 33.844786 |
|---|---|
| Coefficient of variation (CV) | 2.1182014 |
| Kurtosis | 1.9857813 |
| Mean | 15.978077 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9781829 |
| Sum | 1209141 |
| Variance | 1145.4695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 48471 | |
| 99 | 10056 | 7.2% |
| 2 | 9783 | 7.0% |
| 9 | 3677 | 2.6% |
| 3 | 887 | 0.6% |
| 78 | 789 | 0.6% |
| 18 | 526 | 0.4% |
| 4 | 454 | 0.3% |
| 68 | 354 | 0.3% |
| 7 | 310 | 0.2% |
| Other values (6) | 368 | 0.3% |
| (Missing) | 64325 |
| Value | Count | Frequency (%) |
| 1 | 48471 | |
| 2 | 9783 | 7.0% |
| 3 | 887 | 0.6% |
| 4 | 454 | 0.3% |
| 5 | 84 | 0.1% |
| 6 | 17 | < 0.1% |
| 7 | 310 | 0.2% |
| 9 | 3677 | 2.6% |
| 18 | 526 | 0.4% |
| 28 | 123 | 0.1% |
| Value | Count | Frequency (%) |
| 99 | 10056 | |
| 78 | 789 | 0.6% |
| 68 | 354 | 0.3% |
| 58 | 4 | < 0.1% |
| 48 | 122 | 0.1% |
| 38 | 18 | < 0.1% |
| 28 | 123 | 0.1% |
| 18 | 526 | 0.4% |
| 9 | 3677 | 2.6% |
| 7 | 310 | 0.2% |
father_age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 66 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.70925 |
| Minimum | 11 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 26 |
| median | 31 |
| Q3 | 39 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 88 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 25.403633 |
|---|---|
| Coefficient of variation (CV) | 0.62402605 |
| Kurtosis | 1.2746793 |
| Mean | 40.70925 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.6991856 |
| Sum | 5699295 |
| Variance | 645.34455 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 21145 | 15.1% |
| 29 | 6782 | 4.8% |
| 28 | 6667 | 4.8% |
| 30 | 6661 | 4.8% |
| 31 | 6591 | 4.7% |
| 27 | 6427 | 4.6% |
| 26 | 6138 | 4.4% |
| 32 | 6122 | 4.4% |
| 33 | 5820 | 4.2% |
| 25 | 5818 | 4.2% |
| Other values (56) | 61829 |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 13 | 4 | < 0.1% |
| 14 | 19 | < 0.1% |
| 15 | 72 | 0.1% |
| 16 | 240 | 0.2% |
| 17 | 591 | 0.4% |
| 18 | 1312 | 0.9% |
| 19 | 2128 | |
| 20 | 2951 | |
| 21 | 3652 |
| Value | Count | Frequency (%) |
| 99 | 21145 | |
| 79 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 75 | 1 | < 0.1% |
| 73 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 71 | 2 | < 0.1% |
| 70 | 2 | < 0.1% |
| 69 | 2 | < 0.1% |
| 68 | 2 | < 0.1% |
record_weight
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 1 | |
|---|---|
| 2 | 2021 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 140000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 137979 | |
| 2 | 2021 | 1.4% |
| alcohol_use | apgar_1min | apgar_5min | born_alive_alive | born_alive_dead | born_dead | child_race | cigarette_use | cigarettes_per_day | day | drinks_per_week | ever_born | father_age | father_race | gestation_weeks | is_male | month | mother_age | mother_married | mother_race | mother_residence_state | plurality | record_weight | source_year | state | wday | weight_gain_pounds | weight_pounds | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| alcohol_use | 1.000 | 0.069 | 0.016 | 0.000 | 0.000 | 0.012 | 0.146 | 1.000 | 1.000 | 0.000 | 1.000 | 0.069 | 0.145 | 0.147 | 0.016 | 0.001 | 0.007 | 0.111 | 0.180 | 0.046 | 0.086 | 0.010 | 1.000 | 0.083 | 0.086 | 0.004 | 0.059 | 0.100 | 0.083 |
| apgar_1min | 0.069 | 1.000 | 0.226 | 0.027 | -0.013 | -0.000 | 0.655 | 0.000 | NaN | -0.008 | -0.035 | -0.023 | -0.014 | -0.026 | -0.111 | 0.000 | -0.003 | 0.052 | 0.070 | -0.031 | 0.022 | 0.017 | 1.000 | 0.637 | 0.000 | -0.008 | 0.011 | 0.007 | 0.637 |
| apgar_5min | 0.016 | 0.226 | 1.000 | 0.034 | -0.006 | -0.025 | -0.093 | 0.030 | 0.026 | -0.012 | -0.040 | 0.038 | -0.015 | -0.029 | 0.105 | 0.000 | -0.004 | -0.008 | 0.032 | -0.020 | 0.106 | 0.017 | 1.000 | -0.088 | 0.131 | -0.004 | -0.008 | 0.100 | -0.088 |
| born_alive_alive | 0.000 | 0.027 | 0.034 | 1.000 | 0.090 | 0.150 | 0.045 | 0.000 | 0.143 | 0.012 | 0.026 | 0.978 | 0.179 | -0.003 | -0.082 | 0.000 | 0.006 | 0.373 | 0.009 | 0.041 | 0.000 | 0.000 | 0.056 | 0.019 | 0.018 | -0.004 | -0.095 | 0.052 | 0.019 |
| born_alive_dead | 0.000 | -0.013 | -0.006 | 0.090 | 1.000 | 0.059 | 0.000 | 0.000 | 0.033 | 0.004 | 0.055 | 0.174 | 0.029 | 0.025 | -0.017 | 0.005 | -0.001 | 0.046 | 0.011 | 0.037 | 0.049 | 0.000 | 0.085 | -0.051 | 0.059 | 0.000 | -0.023 | -0.020 | -0.051 |
| born_dead | 0.012 | -0.000 | -0.025 | 0.150 | 0.059 | 1.000 | 0.049 | 0.016 | 0.044 | -0.009 | 0.074 | 0.198 | 0.079 | -0.001 | -0.043 | 0.005 | -0.005 | 0.184 | 0.010 | 0.006 | 0.051 | 0.000 | 0.085 | 0.069 | 0.060 | 0.009 | -0.033 | 0.002 | 0.069 |
| child_race | 0.146 | 0.655 | -0.093 | 0.045 | 0.000 | 0.049 | 1.000 | 0.000 | NaN | 0.008 | -0.229 | -0.001 | 0.215 | 0.374 | -0.200 | 0.000 | 0.003 | 0.022 | 0.428 | 0.433 | 0.170 | 0.000 | 0.245 | 0.675 | 0.302 | -0.010 | 0.001 | -0.094 | 0.675 |
| cigarette_use | 1.000 | 0.000 | 0.030 | 0.000 | 0.000 | 0.016 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.069 | 0.167 | 0.138 | 0.022 | 0.003 | 0.000 | 0.129 | 0.188 | 0.052 | 0.000 | 0.012 | 1.000 | 0.006 | 0.045 | 0.006 | 0.062 | 0.106 | 0.006 |
| cigarettes_per_day | 1.000 | NaN | 0.026 | 0.143 | 0.033 | 0.044 | NaN | 1.000 | 1.000 | NaN | 0.022 | 0.099 | -0.025 | -0.092 | -0.019 | 0.000 | -0.017 | 0.077 | 0.056 | -0.183 | 0.259 | 0.000 | 1.000 | -0.102 | 0.259 | 0.004 | -0.029 | -0.040 | -0.102 |
| day | 0.000 | -0.008 | -0.012 | 0.012 | 0.004 | -0.009 | 0.008 | 0.000 | NaN | 1.000 | NaN | 0.004 | -0.002 | 0.001 | -0.015 | 0.000 | 0.015 | -0.015 | 0.000 | 0.011 | 0.000 | 0.011 | 0.010 | 0.004 | 0.000 | NaN | NaN | -0.002 | 0.004 |
| drinks_per_week | 1.000 | -0.035 | -0.040 | 0.026 | 0.055 | 0.074 | -0.229 | 1.000 | 0.022 | NaN | 1.000 | 0.013 | 0.079 | 0.040 | -0.027 | 0.004 | 0.003 | 0.092 | 0.000 | 0.159 | 0.084 | 0.000 | 1.000 | -0.269 | 0.124 | 0.027 | -0.004 | 0.023 | -0.269 |
| ever_born | 0.069 | -0.023 | 0.038 | 0.978 | 0.174 | 0.198 | -0.001 | 0.069 | 0.099 | 0.004 | 0.013 | 1.000 | 0.189 | -0.006 | -0.091 | 0.002 | -0.000 | 0.355 | 0.056 | 0.024 | 0.058 | 0.037 | 0.070 | -0.006 | 0.069 | -0.004 | -0.104 | 0.041 | -0.006 |
| father_age | 0.145 | -0.014 | -0.015 | 0.179 | 0.029 | 0.079 | 0.215 | 0.167 | -0.025 | -0.002 | 0.079 | 0.189 | 1.000 | 0.456 | -0.056 | 0.006 | 0.007 | 0.392 | 0.580 | 0.179 | 0.050 | 0.027 | 0.049 | 0.009 | 0.103 | 0.004 | -0.041 | -0.031 | 0.009 |
| father_race | 0.147 | -0.026 | -0.029 | -0.003 | 0.025 | -0.001 | 0.374 | 0.138 | -0.092 | 0.001 | 0.040 | -0.006 | 0.456 | 1.000 | -0.052 | 0.000 | 0.008 | -0.217 | 0.457 | 0.631 | 0.013 | 0.000 | 0.071 | 0.029 | 0.000 | -0.000 | -0.049 | -0.157 | 0.029 |
| gestation_weeks | 0.016 | -0.111 | 0.105 | -0.082 | -0.017 | -0.043 | -0.200 | 0.022 | -0.019 | -0.015 | -0.027 | -0.091 | -0.056 | -0.052 | 1.000 | 0.011 | -0.001 | -0.063 | 0.056 | -0.060 | 0.018 | 0.109 | 0.101 | -0.056 | 0.033 | -0.002 | 0.055 | 0.379 | -0.056 |
| is_male | 0.001 | 0.000 | 0.000 | 0.000 | 0.005 | 0.005 | 0.000 | 0.003 | 0.000 | 0.000 | 0.004 | 0.002 | 0.006 | 0.000 | 0.011 | 1.000 | 0.000 | 0.008 | 0.004 | 0.007 | 0.000 | 0.000 | 0.000 | 0.001 | 0.007 | 0.005 | 0.025 | 0.110 | 0.001 |
| month | 0.007 | -0.003 | -0.004 | 0.006 | -0.001 | -0.005 | 0.003 | 0.000 | -0.017 | 0.015 | 0.003 | -0.000 | 0.007 | 0.008 | -0.001 | 0.000 | 1.000 | 0.003 | 0.015 | 0.009 | 0.017 | 0.000 | 0.003 | -0.004 | 0.021 | -0.000 | -0.019 | -0.007 | -0.004 |
| mother_age | 0.111 | 0.052 | -0.008 | 0.373 | 0.046 | 0.184 | 0.022 | 0.129 | 0.077 | -0.015 | 0.092 | 0.355 | 0.392 | -0.217 | -0.063 | 0.008 | 0.003 | 1.000 | 0.438 | -0.091 | 0.034 | 0.043 | 0.067 | 0.072 | 0.071 | 0.004 | -0.025 | 0.087 | 0.072 |
| mother_married | 0.180 | 0.070 | 0.032 | 0.009 | 0.011 | 0.010 | 0.428 | 0.188 | 0.056 | 0.000 | 0.000 | 0.056 | 0.580 | 0.457 | 0.056 | 0.004 | 0.015 | 0.438 | 1.000 | 0.077 | 0.064 | 0.030 | 0.055 | 0.093 | 0.053 | 0.026 | 0.081 | 0.118 | 0.093 |
| mother_race | 0.046 | -0.031 | -0.020 | 0.041 | 0.037 | 0.006 | 0.433 | 0.052 | -0.183 | 0.011 | 0.159 | 0.024 | 0.179 | 0.631 | -0.060 | 0.007 | 0.009 | -0.091 | 0.077 | 1.000 | 0.107 | 0.000 | 0.029 | -0.035 | 0.053 | -0.000 | -0.047 | -0.146 | -0.035 |
| mother_residence_state | 0.086 | 0.022 | 0.106 | 0.000 | 0.049 | 0.051 | 0.170 | 0.000 | 0.259 | 0.000 | 0.084 | 0.058 | 0.050 | 0.013 | 0.018 | 0.000 | 0.017 | 0.034 | 0.064 | 0.107 | 1.000 | 0.000 | 0.350 | 0.159 | 0.997 | 0.025 | 0.009 | 0.040 | 0.159 |
| plurality | 0.010 | 0.017 | 0.017 | 0.000 | 0.000 | 0.000 | 0.000 | 0.012 | 0.000 | 0.011 | 0.000 | 0.037 | 0.027 | 0.000 | 0.109 | 0.000 | 0.000 | 0.043 | 0.030 | 0.000 | 0.000 | 1.000 | 0.006 | 0.012 | 0.000 | 0.003 | 0.044 | 0.183 | 0.012 |
| record_weight | 1.000 | 1.000 | 1.000 | 0.056 | 0.085 | 0.085 | 0.245 | 1.000 | 1.000 | 0.010 | 1.000 | 0.070 | 0.049 | 0.071 | 0.101 | 0.000 | 0.003 | 0.067 | 0.055 | 0.029 | 0.350 | 0.006 | 1.000 | 0.933 | 0.348 | 1.000 | 1.000 | 0.020 | 0.933 |
| source_year | 0.083 | 0.637 | -0.088 | 0.019 | -0.051 | 0.069 | 0.675 | 0.006 | -0.102 | 0.004 | -0.269 | -0.006 | 0.009 | 0.029 | -0.056 | 0.001 | -0.004 | 0.072 | 0.093 | -0.035 | 0.159 | 0.012 | 0.933 | 1.000 | 0.316 | -0.001 | -0.071 | -0.018 | 1.000 |
| state | 0.086 | 0.000 | 0.131 | 0.018 | 0.059 | 0.060 | 0.302 | 0.045 | 0.259 | 0.000 | 0.124 | 0.069 | 0.103 | 0.000 | 0.033 | 0.007 | 0.021 | 0.071 | 0.053 | 0.053 | 0.997 | 0.000 | 0.348 | 0.316 | 1.000 | 0.044 | 0.044 | 0.083 | 0.316 |
| wday | 0.004 | -0.008 | -0.004 | -0.004 | 0.000 | 0.009 | -0.010 | 0.006 | 0.004 | NaN | 0.027 | -0.004 | 0.004 | -0.000 | -0.002 | 0.005 | -0.000 | 0.004 | 0.026 | -0.000 | 0.025 | 0.003 | 1.000 | -0.001 | 0.044 | 1.000 | 0.011 | -0.008 | -0.001 |
| weight_gain_pounds | 0.059 | 0.011 | -0.008 | -0.095 | -0.023 | -0.033 | 0.001 | 0.062 | -0.029 | NaN | -0.004 | -0.104 | -0.041 | -0.049 | 0.055 | 0.025 | -0.019 | -0.025 | 0.081 | -0.047 | 0.009 | 0.044 | 1.000 | -0.071 | 0.044 | 0.011 | 1.000 | 0.131 | -0.071 |
| weight_pounds | 0.100 | 0.007 | 0.100 | 0.052 | -0.020 | 0.002 | -0.094 | 0.106 | -0.040 | -0.002 | 0.023 | 0.041 | -0.031 | -0.157 | 0.379 | 0.110 | -0.007 | 0.087 | 0.118 | -0.146 | 0.040 | 0.183 | 0.020 | -0.018 | 0.083 | -0.008 | 0.131 | 1.000 | -0.018 |
| year | 0.083 | 0.637 | -0.088 | 0.019 | -0.051 | 0.069 | 0.675 | 0.006 | -0.102 | 0.004 | -0.269 | -0.006 | 0.009 | 0.029 | -0.056 | 0.001 | -0.004 | 0.072 | 0.093 | -0.035 | 0.159 | 0.012 | 0.933 | 1.000 | 0.316 | -0.001 | -0.071 | -0.018 | 1.000 |
| source_year | year | month | day | wday | state | is_male | child_race | weight_pounds | plurality | apgar_1min | apgar_5min | mother_residence_state | mother_race | mother_age | gestation_weeks | lmp | mother_married | mother_birth_state | cigarette_use | cigarettes_per_day | alcohol_use | drinks_per_week | weight_gain_pounds | born_alive_alive | born_alive_dead | born_dead | ever_born | father_race | father_age | record_weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2005 | 2005 | 1 | NaN | 3.0 | NaN | True | NaN | 7.804364 | 1.0 | NaN | NaN | NaN | 38.0 | 30 | 40.0 | 4172004 | True | NaN | NaN | NaN | NaN | NaN | 99.0 | 5.0 | 2.0 | 1.0 | 8.0 | 38.0 | 24 | 1 |
| 1 | 2005 | 2005 | 2 | NaN | 7.0 | NaN | False | NaN | 5.374870 | 2.0 | NaN | 8.0 | NaN | 1.0 | 29 | 38.0 | 5992004 | True | NaN | False | NaN | False | NaN | 8.0 | 9.0 | 0.0 | 0.0 | 10.0 | 1.0 | 31 | 1 |
| 2 | 2005 | 2005 | 5 | NaN | 4.0 | NaN | False | NaN | 6.437498 | 1.0 | NaN | 7.0 | NaN | NaN | 24 | 39.0 | 99999999 | True | NaN | NaN | NaN | NaN | NaN | 99.0 | NaN | NaN | NaN | NaN | NaN | 31 | 1 |
| 3 | 2005 | 2005 | 8 | NaN | 4.0 | NaN | False | NaN | 6.560957 | 1.0 | NaN | 9.0 | NaN | NaN | 26 | NaN | 99999999 | False | NaN | NaN | NaN | NaN | NaN | 25.0 | NaN | NaN | NaN | NaN | NaN | 99 | 1 |
| 4 | 2005 | 2005 | 5 | NaN | 6.0 | NaN | True | NaN | 8.811877 | 1.0 | NaN | 9.0 | NaN | NaN | 22 | 40.0 | 99999999 | False | NaN | NaN | NaN | NaN | NaN | 99.0 | NaN | NaN | NaN | NaN | NaN | 33 | 1 |
| 5 | 2005 | 2005 | 5 | NaN | 4.0 | NaN | False | NaN | 6.124442 | 1.0 | NaN | NaN | NaN | 7.0 | 28 | 39.0 | 8112004 | True | NaN | NaN | NaN | NaN | NaN | 99.0 | 0.0 | 0.0 | 0.0 | 1.0 | 7.0 | 33 | 1 |
| 6 | 2005 | 2005 | 10 | NaN | 6.0 | NaN | True | NaN | 6.876218 | 1.0 | NaN | 9.0 | NaN | NaN | 30 | 40.0 | 1112005 | True | NaN | NaN | NaN | NaN | NaN | 25.0 | 0.0 | 0.0 | 1.0 | 1.0 | NaN | 50 | 1 |
| 7 | 2005 | 2005 | 10 | NaN | 2.0 | NaN | True | NaN | 6.757168 | 1.0 | NaN | 9.0 | NaN | NaN | 19 | 38.0 | 1102005 | False | NaN | NaN | NaN | NaN | NaN | 25.0 | 0.0 | 0.0 | 0.0 | 1.0 | NaN | 24 | 1 |
| 8 | 2005 | 2005 | 10 | NaN | 2.0 | NaN | False | NaN | 8.688418 | 1.0 | NaN | 9.0 | NaN | NaN | 27 | 39.0 | 1192005 | False | NaN | NaN | NaN | NaN | NaN | 47.0 | 0.0 | 0.0 | 0.0 | 1.0 | NaN | 30 | 1 |
| 9 | 2005 | 2005 | 9 | NaN | 6.0 | NaN | False | NaN | 6.999677 | 1.0 | NaN | 9.0 | NaN | NaN | 20 | 40.0 | 12142004 | False | NaN | NaN | NaN | NaN | NaN | 42.0 | 0.0 | 0.0 | 0.0 | 1.0 | NaN | 30 | 1 |
| source_year | year | month | day | wday | state | is_male | child_race | weight_pounds | plurality | apgar_1min | apgar_5min | mother_residence_state | mother_race | mother_age | gestation_weeks | lmp | mother_married | mother_birth_state | cigarette_use | cigarettes_per_day | alcohol_use | drinks_per_week | weight_gain_pounds | born_alive_alive | born_alive_dead | born_dead | ever_born | father_race | father_age | record_weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 139990 | 1970 | 1970 | 3 | 21.0 | NaN | AR | False | 1.0 | 7.187070 | NaN | NaN | NaN | AR | 1.0 | 28 | NaN | 88881908 | True | AR | NaN | NaN | NaN | NaN | NaN | 3.0 | 0.0 | 1.0 | 5.0 | 1.0 | 31 | 2 |
| 139991 | 1970 | 1970 | 9 | 15.0 | NaN | AR | False | 1.0 | 6.937947 | NaN | NaN | NaN | TN | 1.0 | 23 | NaN | 88881908 | True | TN | NaN | NaN | NaN | NaN | NaN | 3.0 | 0.0 | 0.0 | 4.0 | 1.0 | 33 | 2 |
| 139992 | 1970 | 1970 | 6 | 19.0 | NaN | AR | True | 1.0 | 7.500126 | NaN | NaN | NaN | AR | 1.0 | 42 | NaN | 88881908 | True | Foreign | NaN | NaN | NaN | NaN | NaN | 3.0 | 0.0 | 0.0 | 4.0 | 1.0 | 47 | 2 |
| 139993 | 1970 | 1970 | 7 | 27.0 | NaN | AR | True | 2.0 | 6.686620 | NaN | NaN | NaN | AR | 2.0 | 28 | NaN | 88881908 | True | AR | NaN | NaN | NaN | NaN | NaN | 6.0 | 0.0 | 0.0 | 7.0 | 2.0 | 54 | 2 |
| 139994 | 1971 | 1971 | 9 | 3.0 | NaN | AR | True | 1.0 | 8.624484 | 1.0 | NaN | NaN | AR | 1.0 | 20 | NaN | 88881918 | True | CA | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 21 | 2 |
| 139995 | 1971 | 1971 | 11 | 10.0 | NaN | AR | False | 1.0 | 6.375769 | 1.0 | NaN | NaN | AR | 1.0 | 19 | NaN | 88881918 | True | AR | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 21 | 2 |
| 139996 | 1971 | 1971 | 12 | 20.0 | NaN | AR | False | 1.0 | 7.251004 | 1.0 | NaN | NaN | AR | 1.0 | 19 | NaN | 88881918 | True | AR | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 22 | 2 |
| 139997 | 1971 | 1971 | 6 | 14.0 | NaN | AR | False | 1.0 | 6.624891 | 1.0 | NaN | NaN | AR | 1.0 | 20 | NaN | 88881918 | True | AR | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 26 | 2 |
| 139998 | 1971 | 1971 | 5 | 18.0 | NaN | AR | True | 1.0 | 6.437498 | 1.0 | NaN | NaN | AR | 1.0 | 18 | NaN | 88881918 | True | AR | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 20 | 2 |
| 139999 | 1971 | 1971 | 4 | 17.0 | NaN | AR | True | 1.0 | 6.937947 | 1.0 | NaN | NaN | AR | 1.0 | 19 | NaN | 88881918 | True | AR | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | NaN | 1.0 | 29 | 2 |